189-2007: Introducing a SAS® Macro for Doubly Robust Estimation

نویسندگان

  • Michele Jonsson
  • Weisen
چکیده

Estimation of the effect of a treatment or exposure with a causal interpretation from studies where exposure is not randomized may be biased if confounding and selection bias are not taken into appropriate account. Such adjustment for confounding is often carried out through regression modeling of the relationships among treatment, confounders, and outcome. Correct specification of the regression model is one of the most fundamental assumptions in statistical analysis. Even when all relevant confounders have been measured, an unbiased estimate of the treatment effect will be obtained only if the model itself reflects the true relationship among treatment, confounders, and the outcome. Outside of simulation studies, we can never know whether or not the model we have constructed includes all relevant confounders and accurately depicts those relationships. Doubly robust estimation of the effect of exposure on outcome combines inverse probability weighting by a propensity score with regression modeling in such a way that as long as either the propensity score model is correctly specified or the regression model is correctly specified the effect of the exposure on the outcome will be correctly estimated, assuming that there are no unmeasured confounders. While several authors have shown doubly-robust estimators to be powerful tools for modeling, they are not in common usage yet in part because they are difficult to implement. We have developed a simple SAS® macro for obtaining doubly robust estimates. We will present sample code and results from analyses of simulated data. INTRODUCTION Correct specification of the regression model is one of the most fundamental assumptions in statistical analysis. Even when all relevant confounders have been measured, an unbiased estimate will be obtained only if the model itself reflects the true relationship among treatment, confounders, and the outcome. Outside of simulation studies, we can never know whether or not the model we have constructed accurately depicts those relationships. So the correct specification of the regression model is typically an unverifiable assumption. Doubly robust (DR) estimation builds on the propensity score approach of Rosenbaum & Rubin (1983) and the inverse probability of weighting (IPW) approach of Robins and colleagues (Robins, 1998; Robins, 1998a; Robins, 1999; Robins, 1999a; Robins, Hernan, and Brumback, 2000). DR estimation combines inverse probability weighting by a propensity score with regression modeling of the relationship between covariates and outcome in such a way that as long as either the propensity score model or the regression model is correctly specified, the effect of the exposure on the outcome will be correctly estimated, assuming that there are no unmeasured confounders (Robins, Rotnitzky, and Zhao, 1994; Robins, 2000; van der Laan and Robins, 2003; Bang and Robins 2005). Specifically, one estimates the probability that a particular patient receives a given treatment as a function of that individual’s covariates (the propensity score). Each individual observation is then given a weight equal to the inverse of this propensity score to create two pseudopopulations of exposed and unexposed subjects that now represent what would have happened to the entire population under those two treatment conditions. Maximum likelihood regression is conducted within these pseudopopulations with adjustment for confounders and risk factors. Results from extensive simulations by Lunceford and Davidian (2004) as well as Bang and Robins (2005) confirm the theoretical properties of this estimator. MATHEMATICS OF DOUBLY ROBUST ESTIMATION We use the following notation: Y is the observed response or outcome, Z is a binary treatment (exposure) variable, and X represents a vector of baseline covariates. Y1 and Y0 are the counterfactual responses under treatment and no treatment, respectively. All of these variables are further subscripted by i for subjects i=1, . . ., n. In this example, the causal effect of interest is the difference in means if everyone in the population received treatment versus SAS Global Forum 2007 Statistics and Data Analysis

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Practice of Epidemiology Doubly Robust Estimation of Causal Effects

Doubly robust estimation combines a form of outcome regression with a model for the exposure (i.e., the propensity score) to estimate the causal effect of an exposure on an outcome. When used individually to estimate a causal effect, both outcome regression and propensity score methods are unbiased only if the statistical model is correctly specified. The doubly robust estimator combines these ...

متن کامل

On a closed-form doubly robust estimator of the adjusted odds ratio for a binary exposure.

Epidemiologic studies often aim to estimate the odds ratio for the association between a binary exposure and a binary disease outcome. Because confounding bias is of serious concern in observational studies, investigators typically estimate the adjusted odds ratio in a multivariate logistic regression which conditions on a large number of potential confounders. It is well known that modeling er...

متن کامل

A SAS macro for estimation of direct adjusted survival curves based on a stratified Cox regression model

Often in biomedical research the aim of a study is to compare the outcomes of several treatment arms while adjusting for multiple clinical prognostic factors. In this paper we focus on computation of the direct adjusted survival curves for different treatment groups based on an unstratified or a stratified Cox model. The estimators are constructed by taking the average of the individual predict...

متن کامل

Doubly Robust Estimation of Optimal Dynamic Treatment Regimes

We compare methods for estimating optimal dynamic decision rules from observational data, with particular focus on estimating the regret functions defined by Murphy (in J. R. Stat. Soc., Ser. B, Stat. Methodol. 65:331-355, 2003). We formulate a doubly robust version of the regret-regression approach of Almirall et al. (in Biometrics 66:131-139, 2010) and Henderson et al. (in Biometrics 66:1192-...

متن کامل

%lrasch mml: A SAS Macro for Marginal Maximum Likelihood Estimation in Longitudinal Polytomous Rasch Models

Item response theory models are often applied when a number items are used to measure a unidimensional latent variable. Originally proposed and used within educational research, they are also used when focus is on physical functioning or psychological wellbeing. Modern applications often need more general models, typically models for multidimensional latent variables or longitudinal models for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007